Diversity Begets Stability in an Evolving Network 
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Complex evolving systems such as the biosphere, ecosystems and societies exhibit sudden col- 
lapses, for reasons that are only partially understood. Here we study this phenomenon using a 
mathematical model of a system that evolves under Darwinian selection and exhibits the sponta- 
neous growth, stasis and collapse of its structure. We find that the typical lifetime of the system 
increases sharply with the diversity of its components or species. We also find that the prime reason 
for crashes is a naturally occurring internal fragility of the system. This fragility is captured in 
the network organizational character and is related to a reduced multiplicity of pathways between 
its components. This work suggests new parameters for understanding the robustness of evolving 
molecular networks, ecosystems, societies, and markets. 



Crashes in complex systems include mass extinctions 
in the biosphere as evidenced in the paleontological 
record [l[ collapses of ecosystems @], civilizations Q, 
and crashes of stock-markets 0, 0] . The death of a hu- 
man being due to old age is also an example of collapse of 
a once-robust complex system turned fragile. While some 
of these catastrophic events are caused by large external 
perturbations like meteorite impacts, famines, wars and 
infections, for the vast majority of them no single dra- 
matic cause can be traced [l|, [H, Q, HI ■ Here we explore an 
alternative hypothesis that the prime reason for crashes 
is a fragility in the internal organization of these sys- 
tems that naturally develops in the course of their evo- 
lution, making them vulnerable to small perturbations. 
Unfortunately, empirical data characterizing the 'internal 
fragility' or 'robustness' of such systems is scarce. One of 
the chief problems in collecting data is not knowing what 
to look for; we do not know what system parameters can 
characterize its poisedness for a crash. Hence, a key step 
in identifying possible signatures of fragility is to con- 
struct theoretical and mathematical models of systems 
that exhibit repeated catastrophes in the course of their 
time evolution, whose analysis can reveal structural and 
dynamical features that make them vulnerable to such 
events. An important aspect of a complex system's orga- 
nizational structure is the underlying interaction network 
of its components d, 0, H, Q; hence we need in particu- 
lar to study examples of evolving networks that exhibit 
crashes and recoveries. 

Our model system [l(| EH exhibits these phenomena 
for an evolving network of interacting populations, with 
Darwinian selection and dynamic feedback loops play- 
ing an important role in system evolution. The system 
consists of s nodes, whose network of interactions is spec- 
ified completely by its adjacency matrix C = (cij),i,j = 
1, . . . , s. A node may represent a molecular species in 
a prebiotic pond. The model is motivated by the origin 
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of life problem [13, El El, but may be more generally 
valid. The element cy = 1 if species j 'catalyzes' the 
growth of species i, and zero otherwise. Also, cu — 
for all i, corresponding to the exclusion of self-catalyzing 
species. Relaxing the above restrictions by allowing links 
with different weights and negative signs does not change 
the qualitative behaviour of the model. 

Using the adjacency matrix, we write an equation for 
the population dynamics of the species given by 

s 

in = ^2cijVj - <ka- (i) 

3=1 

Here, yi is the rate of change of the population of species 
i. The first term on the right takes into account the 
positive effect of all the species that catalyze species i, 
each one having an effect proportional to its population. 
The second term is a constant mortality term. 

Cast in terms of the relative populations, Xi = 
Vi/Yfi=i Vh Eq.EQ implies that 

s s 

%i = ^ ^ij^j ^ j £kj%j- (2) 

3 = 1 k,j=l 

The dynamics described by Eq. [3] flows to a fixed point in 
which all Xi become time independent constants. Techni- 
cally, this steady state is just an eigenvector of the matrix 
C corresponding to its largest eigenvalue. For a generic 
non-negative matrix C, it is a unique, global attractor 
(independent of initial conditions), stable against per- 
turbations of the Xi . 

Initially, the matrix C is sparse and drawn from the 
random binomial ensemble with on average m links per 
node (with m < 1). To introduce evolution into the 
model, we note that the pond can be washed by nearby 
tides, floods or storms that can flush out some of the 
contents of the pond. We use the Darwinian dictum 
'Survival of the fittest ' and impose that the species with 
the lowest relative population in the steady state gets 
removed from the system [ToT ]: we eliminate the corre- 
sponding node along with all its links from the graph. 
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FIG. 1: Number of populated species si as a function of 
time. The total number of species s = 100, 300, 500 for the 
three traces from top to bottom, while m = 0.25 for each. 
The number of crashes decrease markedly with increasing s. 
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FIG. 2: Organized state lifetime as a function of s on a semi- 
log plot for h = 0.75 and various values of m denoted by 
□ : m — 0.15, o : m — 0.25, A : m — 0.35. The straight lines 
are least square fits whose slopes gives a(m). 



(If there are more than one such species, we choose one 
at random.) Furthermore, such a fluctuation can bring 
in new species into the pond; we assume for simplicity 
that a single new node gets added to the graph whose 
links with the existing ones are made randomly with the 
same average connectivity to. After each such fluctua- 
tion, the populations evolve according to Eq. [2] with a 
fixed C to reach a new fixed point, whereafter the above 
update sequence is repeated. 

At each ('Darwinian') update, then, the system suffers 
a structural perturbation that modifies C. The perturba- 
tion is small in that only one species is updated, affecting 
the links of only ~ to ~ 0(1) number of species. Since 
the update of C depends on populations, the long time 
dynamics of the populations is highly nonlinear inspite 
of the simplicity of Eqs. Q] and O The typical dynamics 
is shown in Fig. 1 where the number, si, of populated 
species (whose steady state Xi > 0) is plotted against 
time (number of graph updates, n) for three values of 
s = 100,300,500 and fixed m = 0.25. We scale time by 
1/s as each species, on average, can be updated in s steps. 
Initially, the graph is sparse and the number s% is small. 



After a certain time, Si begins to grow and soon reaches 
its maximum value s. Thereafter, the system exhibits a 
stasis for a certain time scale, r, in which s± fluctuates 
between s and s — 1. In this state, which we call the 'or- 
ganized state', all species except possibly the one being 
picked for replacement have Xi > 0. Thereafter, the sys- 
tem experiences a collapse in which s± drops to a fraction 
of s. This is followed by a recovery and a repetition of 
the same kind of dynamics. This behaviour is discussed 
in detail in 0,113 • 

In this letter, we focus on crashes whereby, in a sin- 
gle update step, the number of populated species Si goes 
from s to a fraction h of s. We present results for h — 0.50 
and 0.75. While the absolute number of crashes depends 
upon h, the qualitative results are not very sensitive to 
its value. As shown in Fig. 1, for fixed m, the frequency 
of crashes comes markedly down with increasing s. Sim- 
ilarly, if we increase to for fixed s, the number of crashes 
again decreases markedly. 

For given values of m and s, there is a typical lifetime 
before the network collapses. We define this time r as 
the number of update steps spent in the organized state 
in a given run (typically 10 6 steps long), divided by the 
number of crashes observed during that run. Each run is 
parameterized by s and to and crashes are defined with 
respect to the parameter h. Hence r depends upon s,m, 
and h. The dependence of r on s and m is shown in Fig. 
2, for h = 0.75. For fixed to, t/s grows exponentially 
with s. This behavior is consistent with the empirical 
relation 

- = A(m,h)e a(m - h)s . (3) 
s 

The coefficient a (to, h) is an increasing function of to 
(and a weak function of h) whose quantitative behavior 
is discussed later. 

These results show that the system is more stable 
against crashes as its diversity, s, increases for fixed con- 
nectivity, to, and also as its connectivity, to, increases for 
a fixed diversity. We emphasize that even for low con- 
nectivity the system can be stabilized against collapse by 
increasing its diversity. It turns out that in the organized 
state, the average connectivity of the species is close to 
~ 1 + to; hence, for the values of to given above the av- 
erage connectivity is only slightly above one. Even such 
sparsely connected systems are stabilized in this model 
by a sufficient amount of diversity. 

We now attempt to understand this behaviour in terms 
of the structure of the graph near and far from a crash. 
The organized state has the structure of an autocatalytic 
set (ACS). An ACS is a subgraph, each of whose nodes 
has at least one inco ming link from a node belonging 
to the same subgraph [13|]. In the organized state, all 
the species except possibly the one being picked for re- 
placement are part of the ACS [l(| • The ACS consists 
of a core and a periphery. The core comprises the set of 
nodes (along with their mutual links) from which there 
is a directed path to every other node in the ACS. All 
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FIG. 4: Normalized frquency distribution of closed paths in 
the core across the sample of all time steps in the organized 
state, with logarithmic (base 2) binning, for m = 0.25 and 
s = 500 and solid line). Similar distribution but across 
the smaller sample of time steps just preceding crashes (O 
and dashed line). The curves are least square fits to the data 
using a normal distribution. 



FIG. 3: Examples of network configurations for m = 0.25. 
(a,c): in the normal organized state (far away from a crash) 
and (b,d): in the organized state just before a crash. Core 
nodes are shown in red, periphery in blue. For s=300, pe- 
riphery nodes are not shown to avoid clutter. Note (i) that 
the core is large and has multiple directed pathways between 
any pair of its nodes in the normal organized state (a,c). Just 
before a crash it becomes smaller and does not possess mul- 
tiplicity of pathways (b,d). (ii) As one increases s from 100 
to 300, the number of multiple pathways in the core in the 
normal organized state increases dramatically (compare a and 
c), while in the state poised for a crash it is more or less the 
same and quite small (compare b and d). 



other nodes and links in the ACS constitute the periph- 
ery. Examples of the graph (ACS with core and periph- 
ery) observed in the organized state are shown in Fig. 3. 
By definition there is no directed path from a periphery 
node to any core node. The core, by virtue of closed paths 
inside it, is a 'self-sustaining' structure in the sense that 
all the core nodes would be populated even if the only 
links present in the graph are those in the core. In con- 
trast, the periphery nodes would become depopulated if 
the links from the core to the periphery were to be re- 
moved. In this sense, the periphery nodes are 'parasites' 
that are sustained by the core. 

While there is always by definition at least one path 
from every core node to every other core node, the num- 
ber of such paths is significantly different between a nor- 
mal organized state and a state poised for a crash. In the 
typical organized state there are several paths from each 
core node to another (see Figs. 3(a,c). In such configu- 
rations, no single node addition or deletion can cause a 
crash. However, the number of paths between core nodes 
drops to a much lower value just before a crash (see Figs. 
3(b,d)). Then, a single node change can disrupt the core 
and cause most network species to be depopulated. 
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FIG. 5: ln(Np) vs. s for various values of m denoted by 
symbols □ : m = 0.15, Q : m = 0.25, A : m = 0.35. The 
straight lines are least square fits whose slopes give f3(m). 

In Fig. 4, we plot the frequency distribution of distinct, 
non-intersecting closed paths of all lengths in the core 
in the organized state (filled circles). The distribution 
shows a peak whose position, N p , is dependent upon s 
and to. A plot of ln(N p ) against s for various values of m 
is shown in Fig. 5. This is consistent with the empirical 
formula 

N p = B(m)e^ m >. (4) 

We note that loops in other graph ensembles have also 
been counted [HI ]. 

In Fig. 4, the open symbols show the distribution of 
closed paths in the core just before crashes. Its peak 
occurs at a much smaller value than N p (note that the 
x-axis scale is logarithmic). This is also evident from Fig. 
3 (the cores in (b) and (d) have much fewer closed paths 
than in (a) and (c)). 
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FIG. 6: A double logarithmic plot of coefficients a(m) and 
/3(m) as functions of m. Symbols used are □ : a(m) for 
h = 0.50, A : a(m) for ft = 0.75, and Q : /9(m). 



We find a strong correlation between the coefficients 
a(m) and (3(m). This is shown in Fig. 6 where a(m, ft) 
for two values of ft and (3(rri) are plotted against m. It is 
seen that the dependence of a on ft is weak, as mentioned 
before, and that a and f3 have a similar dependence upon 
to. Thus N p and r/s have a similar dependence on m and 
s. This close correspondence between a structural prop- 
erty like the number of loops in the graph in the organized 
phase and a dynamical property like the lifetime of that 
phase, is one of the surprising results we have found. 

This suggests an explanation of why a higher diversity 
and density of links enhances stability against crashes 
in this model. Diversity increases the number of closed 
paths in the core and thus provides a buffer against 
crashes by ensuring alternate routes of sustenance in the 
event of loss of core nodes. Crashes occur typically when 
the core has thinned out, and such fragile states take 
longer to be realized when there is a larger number of 
paths in the core to begin with. 

As in real evolutionary systems, the model generates 
several dynamical time scales. The model has only two 
parameters: system size or diversity, s, and the average 
connectivity of a new node, m (the latter being typically 
O(l)). In spite of its extreme simplicity, the time scales 
that dynamically appear have a wide range of dependence 
on s, including logarithmic, power law and exponential. 
The time scale for the appearance of an ACS is indepen- 
dent of s and of its growth is ~ In s [1(1 HH ( a t constant 
77i, in scaled units of time as used in Fig. 1. Once a crash 
sets in it occurs fast - on a time scale ~ 1/s in the present 
version of the model. The fast collapse and a relatively 
slower growth seen in the model is an observed feature 
in fossil records as well as stock markets. The lifetime of 
the system between its growth and collapse has turned 
out to be the timescale that is the most sensitive to its 
diversity, namely, ~ e as , as shown here. Such a depen- 
dence means that there is a threshold scale of diversity 
set by 1/a, such that if diversity is well above this scale 
the system is robust to crashes, but if it is close to or 
lower it is vulnerable. 



The dynamics of growth and collapse in our model 
is different from other existing models, including vari- 
ous models of extinction studied in the literature (see 
the review [l9[ and references therein). The seed for the 
growth of complexity in this model is a small feedback 
loop (usually a 2-cycle) that arises in the network by 
chance. The cooperativity implicit in this autocatalytic 
structure causes its nodes to have much higher popu- 
lations than other nodes. Under a selection dynamics 
that preferentially preserves nodes with higher popula- 
tion, such a structure is stable and grows in complexity 
until it spans the whole system. Then, the same selection 
dynamics causes its components, erstwhile cooperators, 
to become competitors. This leads to internal organiza- 
tional restructuring, and, on a certain timescale, when 
the internal feedback loops become sparse, to fragility. 
Thus, we have here an example of how the very suc- 
cess and domination of a certain organizational struc- 
ture changes the effective rules of the game leading to 
the collapse of the structure (for another such example 
see the model poj). This is reminiscent of how certain 
civilizations and organizations collapse in the real world 
[3(. The role of feedback loops in a network structure 
that evolves under both selection and stochastic forces is 
also characteristic of several real evolutionary systems. 

Mathematical models of ecosystems suggest that a 
large number of complex factors determine ecosystem 
stability under various types of perturbations (see the 
reviews [U HH, [23|, HiJ and references therein). The im- 
portance of the multiplicity of sustenance pathways of 
species, suggested by MacArthur (25[, is analogous to 
the result we have found above. Note that as in the core 
of our graphs, so in ecosystems at the most basic level 
there exist several feedback loops between plants and mi- 
crobial communities that feed on detritus and restore soil 
nutrients. These self-sustaining parts of the ecosystem 
are probably more primitive, and their dynamics rela- 
tively independent of the more 'peripheral' higher trophic 
levels that they support. Disruption of these feedback 
pathways would, beyond a certain point, be catastrophic 
for the ecosystem as a whole. Most ecosystem mod- 
els concerned with stability typically take into account 
only the plants and higher trophic levels and exclude mi- 
crobes that provide essential feedback loops. Our work 
suggests that newer and perhaps clearer patterns may 
emerge when models and field data are considered that 
include microbes along with other trophic levels. 

A point of caution is that Eqs. (1) and (2), motivated 
by catalytic chemical production, would need to be mod- 
ified to represent other systems, e.g., the population dy- 
namics in ecosystems. Nevertheless, it is worthwhile to 
note that several qualitative features of the above model, 
including self-organization and collapse of the network, 
are preserved when we include negative links (that inhibit 
species production) and links with varying strengths, and 
where the network size s is itself a dynamical variable 
with its upper limit statistically determined by a relative 
population threshold below which species are eliminated 
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[l6l. [26. 27]. The determination of system lifetime before 
a crash and the core architecture in these computation- 
ally more demanding versions of the model is presently 
under study and will be reported elsewhere. Needless 
to say, the real causes of fragility would be many and 
varied for different systems: organisms, ecosystems, soci- 
eties, etc. It would be interesting to explore more realistic 
models exhibiting crashes and recoveries to see the extent 
to which they share the behaviour of our simple idealized 
model. 

As in several real world systems, an impending collapse 
is not visible in an obvious way beforehand in the model. 
The fragility of the system is directly visible only if one 
looks at the organizational structure, or the network, and 
observes the internal multiplicity of core pathways (see, 
e.g., Fig. 3). Thus, it may not be enough to quantify 
populations of species in ecosystems, or stock prices of 
companies in markets, or the performance of individual 
organs in an aging human body. One may need more 
'systemic' information about the network of interaction 
among the components and the analysis of internal path- 
ways to assess the true health of these systems. 

A notion of robustness of a complex system to the re- 
moval of nodes has been defined in ref. [281 ] in terms of the 
increase of the network diameter. Our approach is differ- 



ent in that we do not define system robustness in terms 
of a network property, but rather find that its robustness 
as measured by the time interval between crashes is cor- 
related with a network property - the number of closed 
paths in its core. 

It is important to note that our model is concerned pri- 
marily not with stability under perturbations of popula- 
tions in a random network, as discussed by May [23, [3(| , 
but with structural perturbations of node/link removal 
and introduction that arise in the natural course of evo- 
lution in a highly self organized network. It shows how 
an increase in diversity and link density can contribute 
to long term system stability against crashes caused by 
such perturbations by increasing the cooperative routes 
of sustenance in the network. 
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